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Description 

Technical Field 

5 [0001 ] The present invention relates to cytochrome C that constitutes a glucose dehydrogenase p subunit f to a DNA 
encoding the cytochrome C, and to their utilization. The glucose dehydrogenase is useful for a glucose sensor using 
an enzyme electrode or the like. 

Background Art 

10 

[0002] Biosensors using an enzyme that specifically reacts with a particular substrate are being actively developed 
in various industrial fields. As for a glucose sensor, which is one of the biosensors, in particular, measurement methods 
and devices utilizing such methods are being actively developed mainly in medical fields. For example, the glucose 
sensor has a history of about 40 years since Clark and Lyons first reported about a biosensor including glucose oxidase 
15 and an oxygen electrode in combination in 1962 (L.c. Clark, J. and Lyonas, C. "Electrode systems for continuous 
monitoring in cardiovascular surgery." Ann. n. y. Acad. Sci., 105: 20-45). 

[0003] Thus, the adoption of glucose oxidase as an enzyme for the glucose sensor has a long history. This is because 
glucose oxidase shows high substrate specificity for glucose and superior thermal stability, this enzyme can further be 
produced in a large scale, and its production cost is lower than those of other enzymes. The high substrate specificity 

20 means that this enzyme does not react with a saccharide other than glucose, and this leads to an advantage that 
accurate measurement can be achieved without error in measurement values. Further, the superior thermal stability 
means that problems concerning denaturation of the enzyme and inactivation of its enzymatic activity due to heat can 
be prevented, and this leads to an advantage that accurate measurement can be performed over a long period of time. 
[0004] However, although glucose oxidase has advantages as described above, it has a problem that the enzyme 

25 is affected by dissolved oxygen and this affects measurement results. 

[0005] Meanwhile, in addition to glucose oxidase, a glucose sensor utilizing glucose dehydrogenase (hereinafter 
referred to as "glucose dehydrogenase" or "GDH") has also been developed. This enzyme is also found in microor- 
ganisms. For example, there are known glucose dehydrogenase derived from Bacillus (EC 1.1.1.47) and glucose 
dehydrogenase derived from Cryptococcus (EC 1 .1 .1 .119). 

30 [0006] The former glucose dehydrogenase (EC 1 .1 .1 .47) is an enzyme that catalyzes a reaction of p-D-glucose + 
NAD(P) + -> D-5-gluconolactone + NAD(P)H + H + , and the latter glucose dehydrogenase (EC 1 .1.1 .119) is an enzyme 
that catalyzes a reaction of D-glucose 4- NADP + -> D-6-gluconolactone + NADPH + H + . The aforementioned glucose 
dehydrogenases derived from microorganisms are already marketed. 

[0007] These glucose dehydrogenases have an advantage that they are not affected by dissolved oxygen in a meas- 
35 urement sample. This leads to an advantage that accurate measurement can be achieved without causing errors in 
measurement results even when the measurement is performed in an environment in which the oxygen partial pressure 
is low, or a high-concentration sample requiring a large amount of oxygen is used for the measurement. 
[0008] However, although conventional glucose dehydrogenase is not affected by dissolved oxygen, it has problems 
of poor thermal stability and substrate specificity poorer than that of glucose oxidase. For an enzyme which is used in 
to a sensor, an enzyme that overcomes disadvantages of both of glucose oxidase and glucose dehydrogenase has been 
desired. 

[0009] The inventors of the present invention reported results of their studies about GDH using samples collected 
from soil near hot springs inSode,K.,Tsugawa,W., Yamazaki.T, Watanabe.M., Ogasawara,N.,andTanaka,M., Enzyme 
Microb. Techno!., 19, 82-85 (1996); Yamazaki,T, Tsugawa,W. and Sode,K., Appli. BiochemL and Biotec, 77-79/0325 

45 (1999); and Yamazaki,T, Tsugawa,W. and Sode,K., Biotec. Lett., 21 , 199-202 (1999). The microorganisms in those 
samples produce a coenzyme-binding GDH, and the enzymologic properties such as optimum reaction temperature, 
thermal stability, and substrate specificity have already been clear (See the aforementioned documents). This enzyme 
is a hetero oligomeric enzyme that is constituted by a catalyst subunit having a high thermal resistance (a subunit), 
an electron transferring subunit (p subunit), and y subunit having an unknown function, and the activity peaks thereof 

so are observed at 45°C and 75°C, respectively. Further, the y and a subunit genes have been cloned, and it has been 
clarified that the aforementioned microorganism belongs to Burkholderia cepacia, and the N-terminal amino acid se- 
quence of the p subunit has been clarified (Ken Inose, Tokyo Agricultural Engineering University Master's Thesis 
(2001)). However, the structure of the p subunit gene has not been reported. 

55 Disclosure of the Invention 

[0010] An object of the present invention is to provide a DNA encoding a GDH p subunit of a microorganism belonging 
to the genus Burkholderia and a method of using the DNA. 
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[0011] The inventors of the present invention have further advanced the study on GDH of Burkholderia cepacia KS1 
strain and were successful in isolating a DNA encoding a GDH P subunit, thereby completing the present invention. 
[0012] That is, the present invention can be described as follows. 

5 (1) A protein defined in the following (A) or (B): 

(A) a protein which has at least the amino acid sequence comprising amino acids 23 to 425 of SEQ ID NO: 16; 

(B) a protein which has at least the amino acid sequence comprising amino acids 23 to 425 of SEQ ID NO: 
16 including substitution, deletion, insertion or addition of 1 to 20 amino acid residues. 

10 

(2) A DNA encoding a protein defined in the following (A) or (B): 

(A) a protein which has at least the amino acid sequence consisting of amino acids 23 to 425 of SEQ ID NO: 1 6; 
r. (B) a protein which has at least the amino acid sequence comprising amino acids 23 to 425 of SEQ ID NO: 
15 1 6 including substitution, deletion, insertion or addition of 1 to 20 amino acid residues. 

(3) The DNA according to item (2), in which the DNA is defined in the following (a) or (b): 

(a) a DNA including the nucleotide sequnece consisting of nucleotides 187 to 1398 of SEQ ID NO: 15; 
20 (b) a DNA which is hybridizable with the nucleotide sequnece consisting of nucleotides 1 87 to 1398 of SEQ 

ID NO: 15 under stringent conditions. 

(4) The DNA according to item (3), further including the nucleotide sequnece consisting of nucleotides 121 to 187 
of SEQ ID NO: 15. 

25 (5) A recombinant vector including a DNA according to any one of items (2) to (4). 

(6) A transformant transformed with a DNA according to any one of items (2) to (4) or the recombinant vector 
according to item (5). 

(7) A method of producing a glucose dehydrogenase p subunit, including culturing the transformant according to 
item (6) to produce a glucose dehydrogenase p subunit as an expression product of the DNA, and collecting the 

30 produced p subunit 

(8) The DNA according to item (3) or (4), further including the nucleotide sequnece encoding an a subunit and a 
7 subunit of glucose dehydrogenase of Burkholderia cepacia. 

(9) A recombinant vector including the DNA according to item (8). 

(1 0) A transformant transformed with the DNA according to item (8) or the recombinant vector accordingto item (9). 
35 (11) A method of producing a glucose dehydrogenase complex, including culturing the transformant according to 

item (1 0) to produce a glucose dehydrogenase complex as an expression product of the DNA, and collecting the 
produced complex. 

Best Mode for Carrying out the Invention 

40 

[0013] Hereinafter, the present invention will be described in detail. 

[0014] The inventors of the present invention have searched and isolated a DNA encoding a GDH p subunit of 
Burkholderia cepacia KS1 strain. The aforementioned strain was deposited at International Patent Organism Depos- 
itary, National Institute of Advanced Industrial Science and Technology (Central 6, 1-1-1, Higashi 1-chome, Tsukuba- 

45 shi, Ibaraki-ken, Japan, postal code: 305-8566) on September 25, 2000 and received a microorganism accession 
number of FERM BP-7306. In the present specification, the DNA encoding the GDH p subunit is sometimes referred 
to as the DNA of the present invention, "P subunit structural gene", or simply H p subunit gene". 
[001 5] The inventors of the present invention have confirmed that GDH produced by Burkholderia cepacia KS1 strain 
is a polymeric protein containing an a subunit, a p subunit, and a y subunit. The protein of the present invention cor- 

so responds to the p subunit out of these subunits. Spectrophotometric analyses for GDH indicate that the absorption 
wavelength of an oxidized GDH resembles the absorption wavelengths of alcohol dehydrogenase and aldehyde de- 
hydrogenase of Gtuconobacter sp. and Acetobacter sp. that are composed of dehydrogenase cytochrome complex, 
and this absorption is lost by heat treatment. This fact and the difference in optimum reaction temperature of the GDH 
between presence and absence of the P subunit as described below have suggested that the p subunit is composed 

55 of cytochrome C. 

[0016] Physical and chemical properties of the above GDH are shown below. 

(1) Function: the enzyme catalyzes dehydrogenation reaction of glucose. 
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(2) The enzyme consists of subunils showing a molecular weight pf about 60 kDa and a molecular weight of about 
43 kDa in SDS-polyacrylamide gel electrophoresis under a reducing condition. 

(3) The enzyme shows a molecular weight of about 380 kDa in gel filtration chromatography using TSK Gel 
G3000SW (Manufactured by Tosoh Corporation). 

5 (4) Optimal reaction temperature: around 45QC (Tris-HCI buffer, pH 8.0). 

Physical and chemical properties of an a-subunit alone are shown below. 

(1) ' The protein has a glucose dehydrogenase activity. 

(2) 1 The protein shows a molecular weight of about 60 kDa in SDS-pofyacrylamide gel electrophoresis under 
10 a reducing condition. 

(3) ' Optimal reaction temperature: around 75°C (Tris-HCI buffer, pH 8.0). 

[001 7) The p subunit can be obtained together with other subunits from a culture of Burkholderia cepacia KS1 strain 
by purifying a GDH complex using GDH activity as an index. The GDH activity can be measured in the same manner 

15 as in the known GDH activity measurement. Specifically, the measurement can be performed as follows. A 10 mM 
potassium phosphate buffer (pH 7.0) containing 594 jiM 1-methoxyphenazine methosulfate (mPMS) and 5.94 uJVI 
2,6-dichlorophenol indophenol (DCIP) are added to an enzyme sample and glucose as a substrate, and the mixture 
is incubated at 37°C. A change in the absorbance of DCIP at 600 nm is traced using a spectrophotometer, and the 
decrease rate in the absorbance is defined as an enzyme reaction rate. 

20 [0018] In addition, since the nucleotide sequence of a gene encoding the P subunit (SEQ ID NO: 15) has been 
determined by the present invention, the p subunit can also be produced by expressing a DNA having the nucleotide 
sequence or a DNA encoding the same amino acid sequence as the amino acid sequence encoded by this DNA in a 
suitable host. The amino acid sequence that can be encoded by the open reading frame (ORF) of SEQ ID NO: 15 is 
shown in SEQ ID NO: 16. The N-terminal amino acid sequence of the p subunit determined from the protein was 

25 identical to the amino acids 23 to 38 of SEQ ID NO: 16. Therefore, the amino acids 1 to 22 are presumed to be a signal 
peptide. Note that although amino acid residue 1 is described as Val in SEQ ID NOs: 15 and 1 6, it has a high possibility 
to be Met and it also has a possibility to be dropped after the translation. 

[0019] Results of homology search on the aforementioned amino acid sequence by BLAST indicated overall high 
homologies; 65% homology with a cytochrome C subunit of oxidoreductase dehydrogenase derived form Ralstonia 

30 solanacearum, 48% homology with a cytochrome C subunit of sorbitol dehydrogenase derived from Gtuconobacter 
oxydans, 44% homology with a cytochrome C subunit of gluconic acid dehydrogenase derived from Eriwinia cypripedii, 
and 55.7% homology on the nucleotide sequnece level or a 46.4% homology on the amino acid level with a cytochrome 
C subunit of 2-keto-gluconic acid dehydrogenase derived from Pantoea ciirea. Furthermore, the amino acid sequences 
of these cytochromes C reserved a hem-linking motif (SEQ ID NO: 18). These facts show that the p subunit of the 

35 present invention is cytochrome C. 

[0020] The p subunit of the present invention may be a protein having the amino acid sequence consisting of the 
amino acids 23 to 425 of SEQ ID NO: 16 including substitution, deletion, insertion, or addion of 1 to 20, preferably 1 
to 10, more preferably 1 to 5 amino acid residues in the amino acid sequence so long as it can function as a GDH P 
subunit. Note that the term "functions as a GDH p subunit" means "functions as cytochrome C without deteriorating 

40 the enzyme activity of the GDH". 

[0021] The DNA of the present invention is a DNA that encodes the aforementioned p subunit and can be obtained 
from, for example, Burkholderia cepacia KS1 strain. The DNA of the present invention has been isolated from the 
chromosomal DNA of Burkholderia cepacia KS1 strain in the course of completion of the present invention. The DNA 
of the present invention can be obtained, for example, by PCR using primers having nucleotide sequences of SEQ ID 

45 NOs: 13 and 14 and the chromosomal DNA of Burkholderia cepacia KS1 strain as a template. In addition, since the 
nucleotide sequence of the DNA of the present invention and the amino acid sequence encoded by the nucleotide 
sequence have been clarified by the present invention, the DNA of the present invention can also be obtained by 
performing chemical synthesis based on these sequences. Furthermore, the DNA of the present invention can be 
obtained from the chromosomal DNA of Burkholderia cepacia KS1 strain by hybridization using the oligonucleotide 

50 prepared based on the aforementioned sequences as probes. Similarly, variants can be obtained from strains other 
than Burkholderia cepacia. 

[0022] The DNA of the present invention may be one encoding a protein having the amino acid sequence consisting 
of the amino acids 23 to 425 of SEQ ID NO: 1 6 or one having the amino acid sequence including substitution, deletion, 
insertion, or addion of 1 to 20, preferably 1 to 1 0, more preferably 1 to 5 amino acid residues and encoding a protein 
. 55 that functions as a GDH p subunit. 

[0023] Specifically, the DNA of the present invention includes a DNA having the nucleotide sequnece consisting of 
the nucleotides 187 to 1398 of SEQ ID NO: 15. Further, the DNA of the present invention may be a DNA that hybridizes 
with SEQ ID NO: 15 or a probe that can be prepared form this sequence under stringent conditions and encodes a 
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protein thai can function as a p subunit. The stringent conditions include those conditions whereby DNAs having a 
70% or more, preferably 80% or more, more preferably 90% or more homology to each other hybridize, specifically 
conditions of 1XSSC, 0.1% SDS, and 60°C. 

[0024] The GDH P subunit can be produced by culturing a transformant that harbors the DNA of the present invention 
5 or a recombinant vector containing the DNA of the present invention to produce the GDH P subunit as an expression 
product of the DNA, and by collecting the GDH p subunit from the microorganism cells or the culture medium. In this 
case, the DNA encoding the GDH p subunit of the present invention may be expressed together with a DNA encoding 
an a subunit or further a DNA encoding a 7 subunit to produce a GDH complex. A DNA fragment that sequentially 
encodes the y subunit and the a subunit can be obtained by PCR using primers having nucleotide sequences of SEQ 
10 ID NOs: 18 and 19. 

[0025] Examples of the microorganism that produces the GDH p subunit or the GDH complex include: enterobacteria 
including Escherichia coli; Gram negative bacteria such as Pseudomonas and Gluconobacter; Gram positive bacteria 
including bacteria belonging to the genus Bacillus such as. Bacillus subtilis; yeasts such as Saccharomyces cerevisiae; 
and molds such as Aspergillus niger. However, the microorganism is not limited to these and any microorganism may 

»5 be used so long as it is a host microorganism suitable for the production of foreign proteins. 

[0026] Vectors that are used for cloning or expressing the DNA of the present inventio n are suitably those constructed 
for gene recombination from plasmids or phages that can autonomously replicate in host microorganisms. Examples 
of vectors for E. coli include pBR322, pUC18, pUC118, pUC19, pUC119, pTrc99A, pBluescript, or SuperCost, which 
is a cosmid. Transfer of the DNA from the vector that has been used in cloning the DNA of Lhe present invention to 

20 other recombinant vectors suitable for expression, etc. can be readily performed by recovering the DNA of the present 
invention from a recombinant vector containing the DNA of the present invention with a restriction enzyme or by the 
PCR method and ligating it with a vector fragment. Furthermore, transformation of microorganisms with these vectors 
can be performed by the competent cell method by treatment with calcium for bacteria belonging to the genus Es- 
cherichia, the protoplast method for bacteria belonging to the genus Bacillus, the KU method or the KUR method for 

25 yeasts, and the micromanipulation method for molds and so forth. In addition, the electroporation method can also be 
used widely. 

[0027] Selection of host microorganisms based on presence or absence of introduction of the target recombinant 
vector therein may be performed by using a chemical resistance marker of the vector containing the target DNA and 
the like. For example, a microorganism that can grow in a selective medium based on a chemical resistance marker 
30 and produces GDH may be selected. 

[0028] As for the culture method of the transformant, culture conditions may be selected by considering nutritional 
and physiological properties of the host. In many cases, liquid culture is performed. It is industrially advantageous to 
perform aeration culture with shaking. 

[0029] As nutrients of the medium, those usually used for culture of microorganisms may be widely used. As carbon 
35 sources, any assimilable carbon compounds may be used, and examples of the compounds to be used include glucose, 
sucrose, lactose, maltose, lactose, molasses, pyruvic acid and so forth. Furthermore, as nitrogen sources, any utilizable 
nitrogen compounds may be used, and examples of the compounds to be used include peptone, meat extracts, yeast 
extracts, casein hydrolysates, soybean cake alkaline extracts and so forth. In addition, phosphate, carbonate, sulfate, 
salts of magnesium, calcium, potassium, iron, manganese, zinc and so forth, particular amino acids, particular vitamins 
40 and so forth are used as required. 

[0030] Although the culture temperature can be appropriately changed in a range in which a bacteria grows and 
produces the protein of the present invention, it is preferably about 20OC to 42QC. The culture time somewhat varies 
depending on the conditions. However, the culture may be completed at an appropriate time estimated to give the 
maximum GDH yield, and the culture time is usually about 1 2 to 72 hours. Although pH of the medium may be appro- 
45 priateiy changed in a range in which a bacteria grows and produces the protein of the present invention, it is preferably 
in the range of about pH 6.0 to 9.0. 

[0031] The culture medium containing cells of the microorganism producing the protein of the present invention in 
the culture can be collected and utilized as they are. However, when the protein of the present invention exists in the 
culture medium, the culture medium is usually separated into the solution containing the protein of the present invention 

so and microorganism cells by filtration, centrifugation or the like in a conventional manner and then used. When the 
protein of the present invention exists in the cells, the cells are collected from the obtained culture by means of filtration, 
centrifugation or the like, and then disrupted by a mechanical method or an enzymatic method such as use of lysozyme. 
Further, a chelating agent such as EDTA and a surfactant are added to the cell to solubilize the protein of the present 
invention, as required, followed by isolation and collection as an aqueous solution. 

55 [0032] Protein may be precipitated from the thus-obtained protein-containing solution by, for example, vacuum con- 
centration, membrane concentration, salting out with ammonium sulfate, sodium sulfate or the like, or a fractional 
precipitation with a hydrophilic organic solvent such as methanol, ethanol, and acetone. Further, heat treatment and 
isoelectric point treatment are also effective purification means. Then, purification can be performed by a suitable 
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combination of gel filtration using an adsorbent, gel filtration agent, etc., absorption chromatography, ion exchange 
chromatography and affinity chromatography to obtain a purified protein of the present invention. 
[0033] A purified enzyme preparation can be obtained by isolation and purification based on column chromatography. 
Although the purified enzyme preparation is preferably purified to such an extent that a single band is obtained in 
electrophoresis (SDS-PAGE), it may contain an a-subunit or a y-subunit. 

[0034] The thus-obtained purified enzyme can be powdered by, for example, lyophilization, vacuum drying, spray 
drying or the like and distributed. 

[0035] The GDH complex consisting of a (3-subunit, an ot-subunit, or whenever required, a y-subunit of the present 
invention, or transformants containing those, may be used for an enzyme electrode of a glucose sensor. As the elec- 
trode, a carbon electrode, gold electrode, platinum electrode or the like may be used, and the GDH of the present 
invention is immobilized on the electrode. Examples of the method for immobilization include a method of using a 
crosslinking reagent, a method of entrapping the enzyme in a polymer matrix, a method of covering the enzyme with 
a dialysis membrane, methods of using a photocross linking polymer, conductive polymer, oxidation-reduction polymer 
or the like. Alternatively, the enzyme may be immobilized in a polymer or immobilized on an electrode by adsorption 
together with an electronic mediator of which typical examples are ferrocene and derivatives thereof, or these methods 
may be used in combination. Typically, the glucose dehydrogenase of the present invention is immobilized on a carbon 
electrode by using glutaraldehyde, and then glutaraldehyde is blocked by a treatment with a reagent having an amine 
group. 

[0036] The glucose concentration can be measured as follows. A buffer is placed in a thermostatic cell, and a mediator 
is added thereto. Then, a constant temperature is maintained. As the mediator, potassium ferricyanide, phenazine 
methosulfate and so forth may be used. An electrode on which the enzyme of the present invention is immobilized is 
used as a working electrode, and a counter electrode (e.g., platinum electrode) and a reference electrode (e.g., Ag/ 
AgCl electrode) are used. After a constant voltage is applied to the carbon electrode to obtain a steady-state current, 
a sample containing glucose is added thereto and the increase of the current is measured. The glucose concentration 
in the sample can be calculated according to a calibration curve produced by using glucose solutions having standard 
concentrations. 

[0037] The GDH complex containing the p subunit of the present invention may be used as a component in an assay 
kit for saccharides such as glucose. Typically, the kit includes in addition to the GDH complex, a buffer necessary for 
the assay, a mediator, a standard solution of, for example, glucose for preparing a calibration curve, and a manual on 
the use of the kit. The enzyme according to the present invention may be provided in various forms, for example, as 
a freeze-dried reagent or as a solution in a suitable stock solution. 

Example 

[0038] Hereinafter, the present invention will be described specifically with reference to examples. 
Reference Example 1 : Isolation of a gene encoding GDH a subunit of Burkholderia cepacia KS1 strain 
<1> Preparation of chromosomal DNA from Burkhorderia cepacia KS1 strain. 

[0039] A chromosomal gene was prepared from the Burkhorderia cepacia KS1 strain in a conventional manner. That 
is, the bacterial strain was shaken overnight at 34oC by using a TL liquid medium (10 g of polypeptone, 1 g of yeast 
extract, 5 g of NaCI, 2 g of KH 2 P0 4> 5 g of glucose in 1 L, pH 7.2). The grown cells were collected by centrifugation. 
The cells were suspended in a solution containing 10 mM NaCI, 20 mM Tris-HCI (pH 8.0), 1 mM EDTA, 0.5% SDS, 
and 1 00 jig/ml proteinase K and treated at 54PC for 6 hours. This mixture was added to an equivalent volume of phenol- 
chloroform and stirred at room temperature for 10 minutes, and then the supernatant was collected by centrifugation. 
The supernatant was added to sodium acetate to give a final concentration of 0.3 M and overlaid with two-fold volume 
of ethanol to precipitate chromosomal DNA in the intermediate layer. The DNA was taken up with a glass rod, washed 
with 70% ethanol, and dissolved in an appropriate amount of TE buffer to obtain a chromosomal DNA solution. 

<2> Determination of N-terminus amino acid sequence of GDH a-subunit 

[0040] GDH purified in the same manner as in Example 2 was concentrated by lyophilization and separated by SDS- 
electrophoresis using 12.5% polyacrylamide to isolate the a-subunit. The thus obtained a-subunit was transferred onto 
a polyvinylidene fluoride membrane, and then the N-terminus amino acid sequence was determined by using an amino 
acid sequencer (PPSQ-10, manufactured by Shimadzu Corporation). As a result, it was found that the enzyme con- 
tained a peptide sequence consisting of 1 1 residues of the amino acid numbers 2 to 12 in the amino acid sequence of 
SEQ ID NO: 3. 
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<3> Cloning of a gene encoding a-subunit 

[0041 J The DNA prepared in <1> (1 uxj) was subjected to partial digestion with a restriction enzyme Sau3AI, followed 
by treatment with calf intestinal alkaline phosphatase (CIAP). Separately, SuperCosI (obtained from Stratagene), which 

s is a cosmid, was treated with BamHI, and the DNA fragment obtained by the partial digestion of the chromosomal DNA 
fragment derived from the ot-15 strain with Sau3Al was introduced into SuperCosI by using T4 DNA ligase. E. coi/'XL- 
1 Blue MR (obtained from Stratagene) was transformed with the obtained recombinant DNA. A transformant was 
selected on an LB agar medium containing 10 u.g/ml neomycin and 25 u,g/ml ampicillin based on neomycin resistance 
and ampicillin resistance, which are antibiotic resistances on SuperCosI. The obtained transformant was cultured in 

10 the LB liquid medium. These transformant cells were collected and suspended in a reagent for measuring the GDH 
activity, and a clone was selected by using dehydrogenase activity for glucose as an index. As a result, one clone 
showing the glucose dehydrogenase activity was obtained. 

<4> Subcloning 

15 

[0042J DNA fragments containing the target gene were prepared from the cosmid, SuperCosI, containing the gene 
encoding the a-subunit obtained in <3>. The inserted gene fragments were cleaved from the cosmid by using a re- 
striction enzyme A/on. These DNA fragments were treated with a restriction enzyme Xba\ and introduced into a plasmid 
pUC18 digested with Xba\, The E. coli DH5 a-MCR strain was transformed with the plasmid pUC18 containing each 
20 inserted fragment, and colonies appeared on an LB agar medium containing 50 ng/ml of ampicillin were collected. The 
obtained transformants were cultured in the liquid LB medium, followed by examination for the GDH activity in the cells 
in the same manner as in <3>. As a result, a transformant showing the GDH activity was obtained. The plasmid was 
extracted from this transformant, and the inserted DNA fragment was analyzed. As a result, an insert of about 8.8 kbp 
was confirmed. This plasmid was designated as pKS1 . 

25 

<5> Determination of the nucleotide sequence 

[0043] Restriction enzyme analysis of the inserted DNA fragment in pKS1 was performed and the nucleotide se- 
quence of the fragment was determined according to the conventional method. As a result, the sequence of the DNA 
30 encoding the N-terminus amino acid sequence of the a-subunit found in <2> was confirmed in this inserted DNA 
fragment, and an open reading frame containing this sequence was found. The determined nucleotide sequence and 
the amino acid sequence that can be encoded by this nucleotide sequence are as shown in SEQ ID NOs: 1 and 3. In 
the nucleotide sequence of SEQ NO: 1 , nucleotide sequence downstream from nucletide number 2,386 encode amino 
acid sequence of SEQ NO: 4, and encode 0-subunit. 

35 

Reference Example 2: Production of a GDH-a-subunit by recomibinant E. coli 

[0044] Since the nucleotide sequence of the a-subunit was determined, a vector was prepared using the aforemen- 
tioned structural gene of the a-subunit, and a transformant was further produced using this vector. 
<*0 [0045] First, a gene to be inserted into the vector was prepared as follows. 

[0046] Amplification was performed by PCR using a genome fragment derived from the KS1 strain as a template so 
that a desired restriction enzyme site is included. The following pair of oligonucleotide primers were used in PCR. 
(Forward) 

45 

5 9 -CCCAAGCTTGGGCCGATACCGATACGCA-3 " (SEQ ID NO: 5) 

(Reverse) 

50 

5 ' -GAGAAGCTTTCCGCACGGTCAGACTTCC- 3 1 (SEQ ID NO: 6) 

[0047] The gene amplified by PCR was digested with a restriction enzyme H/ndlll and inserted into an expression 
vector pFLAG-CTS (SIGMA) at its cloning site, H/ndlll site. The obtained plasmid was designated as pFLAG-CTS/a. 
55 [0048] The E. coli DH5aMCR strain was transformed with the aforementioned plasmid pFLAG-CTS/a, and colonies 
appeared on the LB agar medium containing 50 u.g/ml of ampicillin were collected. 

[0049] Further, when the open reading frame of the pKS1 inserted fragment was searched in the upstream of the a- 
subunit, a structural gene consisting of 507 nucleotides encoding a polypeptide including 168 amino acid residues 
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shown in SEQ ID NO: 2 (nucleotide numbers 258 to 761 in SEQ ID NQ: 1) was newly found. This structural gene was 
considered to encode the 7-subunit. 

[0050J Since it was found that the region encoding the 7-subunit existed upstream of the coding region of the a- 
subunit, a recombinant vector containing a gene having a polycistronic structure continuously including the y-subunit 
and the a-subunit was produced, and a transformant introduced with this vector was constructed. 
[0051] First, a gene to be inserted into the vector was prepared as follows. 

[0052] Amplification was performed by PCR using a genome fragment derived from the KS1 strain continuously 
including the structural gene of the y-subunit and the structural gene of the a-subunit as a template so that a desired 
restriction enzyme site is included. The following pair of oligonucleotide primers were used for PCR. 
(Forward) 

5 1 -CATGCCATGGCACACAACGACAACACT-3 ' (SEQ ID NO: 7) 

(Reverse) 

5 1 -CCCAAGCTTGGGTCAGACTTCCTTCTTCAGC-3 1 (SEQ ID NO: 8) 

[0053] The 5' -terminus and the 3'-terminus of the gene amplified by PCR were digested with Nco\ and Hindi II, re- 
spectively, and the gene was inserted into the vector P Trc99A (Pharmacia) at its cloning site, Ncol/H/ndlll site. The 
obtained plasmid was named pTrc99AAy+a. 

[0054] The E. cofi DH5aMCR strain was transformed with the aforementioned plasmid pTrc99A/?+a ; and colonies 
appeared on the LB agar medium containing 50 p.g/ml of ampicillin were collected. 
25 [0055] The a-subunit was produced using the E. co//DH5otMCR strain transformed with each of the aforementioned 
plasmids pKS1 , pFLAG-CTS/a and pTrc99A/y+a. Each transformant was inoculated into 3 ml of the LB medium con- 
taining 50 ng/rril of ampicillin and cultured at 37QC for 12 hours, and the ceils were collected by centrifugation. The 
cells were disrupted by using a French press (1,500 kgf), and a membrane fraction (10 mM potassium phosphate 
buffer, pH 6.0) was isolated by ultracentrifugation (4PC, 160,400 x g, 90 minutes). 

30 

Reference Example 3: Confirmation of GDH activity 

[0056] First, the GDH activity in each of the aforementioned membrane fractions was confirmed. Specifically, visual 
determination was performed by using 1 0 mM potassium phosphate buffer (pH 7.0) containing 594 u.M of methylphen- 
35 azine methosulfate (mPMS) and 5.94 u,M of 2,6-dichlorophenolindopheol (DCIP). The results are shown below. The 
number of + represents the degree of color change from blue to colorless. 

Membrane fraction of cultured transformant transformed with pFLAG-CTS/a : + 
Membrane fraction of cultured transformant transformed with pKS1 : ++ 
40 Membrane fraction of cultured transformant transformed with pTrc99A/YKx : +++ 

[0057] The GDH activity of the membrane fraction of the cultured transformant transformed with pFLAG-TS/a con- 
taining only with the a-subunit was the lowest, and the GDH activity of the membrane fraction of the cultured trans- 
formant transformed with pTrc99A/7+a, with which a vector was efficiently constructed, was the highest. 
45 [0058] Although the a-subunit was expressed even in the transformant transformed with a vector using only the 
structural gene of the a-subunit, the a-subunit could be efficiently obtained by using a vector containing the structural 
gene of the 7-subunit and the structural gene of the a-subunit in combination. 

[0059] Glucose was assayed using the glucose dehydrogenase of the present invention. The enzymatic activity of 
the glucose dehydrogenase (a-subunit) of the present invention was measured by using glucose having various con- 

50 centrations. The GDH activity was measured in 10 mM potassium phosphate buffer (pH 7.0) containing 594 pM of 
methylphenazine methosulfate (mPMS) and 5.94 uJVl of 2,6-dichlorophenolindopheol (DCIP). An enzyme sample and 
glucose serving as a substrate were added thereto, followed by incubation at 37QC, and change in the absorbance of 
DCIP at 600nm was monitored by using a spectrophotometer. The decreasing rate of the absorbance was measured 
as an enzymatic reaction rate. Glucose could be quantified in the range of 0.01 to 1 .0 mM using the GDH of the present 

55 invention. 
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Example 1 : Isolation of gene encoding a GDH p-subunit of Burkhorderia cepacia KS1 strain 
<1> Searching for Burkholderia cepacia KS1 strain GDH P subunit 

5 [0060] GDH p subunit gene derived from KS1 strain was searched using Burkholderia cepacia J231 5 strain genome 
database of Sanger Centre (http://www.sanger.ac.uk/). Referring to a known N-terminal sequence of KS1 strain GDH 
p subunit (SEQ ID No: 9), there was designed an amino acid sequence (SEQ ID NO: 10) that has high homology to 
each cytochrome c subunit of an alcohol dehydrogenase derived from Acetobacter Sp. or GluconobacterSp. (Tamaki 
T. et aL, Biochim Bioohys Acta 1088 (2): 292-300 (1991), Matsushita K., et al., Biosci. Biotech. Biochem., 56, 304-310 

10 (1992), Takemura H., et al., J Bacterid, 175, 6857-66 (1993), Kondo K. et al., Appl Environ Microbiol, 63, 1131-8 
(1997)); a gluconate dehydrogenase derived from Erwinia sp. or Pseudomonas sp. (Yum DY, et al., J Bacteriol, 179, 
6566-72 (1997), Matsushita K. et aL J Biochem, 85 : 1 173-81 (1 979)); a sorbitol dehydrogenase derived from Glucono- 
bactersp. (Choi, E.S., et al., FEMS Microbiol. Lett., 125, 45-50 (1995)); and a 2-ketogluconate dehydrogenase derived 
from Erwinia sp. or Pantoea sp. (Pujol CJ et al., J Bacteriol, 1 82, 2230-7, (2000)). 

15 [0061] Based on the aforementioned amino acid sequence, gene sequences that encode amino acid sequences 
having high homologies have been searched from the aforementioned database of Burkholderia cepacia J2315 strain 
by BLAST. Then, the obtained five sequences were searched for homology with the C-terminal sequence of the GDH 
a subunit of KS1 strain. As a result, amino acid sequences translated from two gene fragments showed high homologies 
(>90%). Each gene fragment was as short as 200 to 500 bp, so that sequences having high homologies with these 

20 sequences were searched from the genome database of Burkholderia cepacia J2315 strain by BLAST and the frag- 
ments were joined each other. As a result, a fragment of 3,110 bp was obtained. In the obtained nucleotide sequence, 
there existed an ORF that is presumed to be the C-terminus of the GDH and an ORF that is presumed to be cytochrome 
C structural gene of 1 ,275 bp (SEQ ID NO: 11). The amino acid sequence encoded by the ORF is shown in SEQ ID 
NO: 12. Results of comparison between the obtained nucleotide sequence of the J2315 strain and the nucleotide 

25 sequence of the a subunit of KS1 strain that has already been cloned indicate that in the downside of the a subunit, 
the nucleotide sequence having a high homology with the nucleotide sequnece encoding the signal peptide of cyto- 
chrome C of J2315 strain is contained. 

[0062] From the above, the third ORF in the cloned fragment of Burkholderia cepacia KS1 strain obtained in Refer- 
ence Example 1 (nucleotides 2386 et seq. of SEQ ID NO: 1) is presumed to encode the p subunit. The amino acid 
30 sequence at the N-terminus of the purified P subunit corresponds to the 5 amino acid residues translated by the nu- 
cleotide sequence consisting of nucleotides 2452 to 2466 in SEQ ID NO: 1 , which also suggests that the aforementioned 
ORF encodes the p subunit. 

<2> Amplification of p subunit structural gene using inverse PCR method 

35 

(1) Culture of microorganism cell and extraction of genome 

[0063] Using 5 ml of complete medium (0.5% polypepton, 0.3% yeast extract, 0.5% NaCI), KS1 strain was cultured 
with shaking at 37°C overnight. Genome was extracted from the obtained microorganism cells using GennomicPrep™ 
40 Cells and Tissue DNA Isolation Kit (Amersham Pharmacia Biotech). The method was performed in accordance with 
the attached manual. The obtained genome was subjected to phenol/chloroform treatment and precipitated with eth- 
anol, and then dissolved in purified water. 

(2) Cyclization of genome fragment 

45 

[0064] The genome extracted from the KS1 strain was digested with BamHI, EcoRI, H/ndlll, Smal, Sad, and Xho\ 
and the genome fragments were recovered by precipitation with ethanol. Then, 1 \ig of genome digested with the 
restriction enzymes was subjected to ligation reaction using a DNA ligation kit (Takara Shuzo Co., Ltd.) at 1 6°C over- 
night. 

50 

(3) PCR 

[0065] 50 pmol of forward primer (EF1 SEQ ID NO: 13) designed based on the nucleotide sequence in the N-terminal 
signal sequence region of the GDH p subunit of the KS1 strain, 50 pmol of reverse primer (ER1 SEQ ID NO: 14) (all 
55 the primers were synthesized by Invitrogen on consignment), 0.5 ml of LATaq (Takara Bio Co., Ltd.), 8 \i\ of dNTP 
solution, and 5 ^l of 1 0x PCR buffer was added to purified water so as to have a total volume of 50 uJ, and PCR was 
performed using a program temp control system PC-801 (ASTEC). The PCR reaction was performed under the fol- 
lowing conditions: after 30 cycles of 94°C for 5 minutes, 98°C for 20 seconds, 62°C for 30 seconds, 72°C for 6 minutes, 
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and 72°C for 10 minutes. 

[0066] When the genome digested with a restriction enzyme (Smal) is used as a template, a fragment having a size 
of about 2.1 kbp was confirmed on Agarose electrophoresis. 

5 <3> Sequencing of PCR-amplified fragment 

(1) TA cloning 

[0067] After the aforementioned inverse PCR product was electrophoresed on Agarose gel, the band was cut out 
10 and purified using Gene clean II KIT (Bio101 inc.). The fragment was ligated to pGEM-T Vector using pGEMR-T and 
pG EM R-T EASY Vector Systems (Promega). E. co//DH5a was transformed with the ligated vector, andthetransformant 
was cultured on an L-agar medium containing 50 u.g/ml ampicillin, 40 |xg/ml X-Gal, and 0.1 u.M IPTG overnight. From 
the appeared colonies, white colonies were selected and cultured in an L medium containing 50 uxj/ml ampicillin over- 
night, followed by extraction of plasmids from the cells by the alkali method. 

15 

(2) Preparation of sequence sample 

[0068] The obtained plasmid was treated with RNase and 0.6 volume of 20% PEG6000/2.5 M NaCI was added 
thereto. The mixture was left to stand on ice for 1 hour. Thereafter, the mixture was centrifuged at 15,000 rpm and 4°C 
20 for 1 5 minutes to obtain a pellet. The pellet was washed with 70% ethanol and dried in vacuum. The dried product was 
dissolved in purified water. 

(3) Analysis of nucleotide sequence of DNA 

25 [0069] The nucleotide sequence of the inserted fragment of the plasmid obtained in (2) was analyzed using AB) 
PRISM™ 310 Genetic Analyzer (PERKIN-ELMER Applied Biosystems). A portion of the sequence of the inserted 
fragment was determined from the multicloning site of the vector using M1 3 primer. As a result, the nucleotide sequnece 
containing the N-terminus of the p subunit that had been already analyzed was confirmed. Based on this sequence, 
primers were sequentially prepared and used to determine the nucleotide sequence of the inserted fragment. The 

30 result is shown in SEQ ID NO: 15. Further, the amino acid sequence encoded by the ORF in the nucleotide sequence 
is shown in SEQ ID NO: 16. 

[0070] The p subunit has 425 amino acid residues in total, and from the comparison with the N-terminal amino acid 
sequence already obtained, 22 residues among them are considered to be a signal peptide. The molecular weight of 
the p subunit calculated based on the amino acid sequence is 45,276 Da and the molecular weight 42,731 Da of the 
35 portion excluding the signal peptide is substantially identical to the molecular weight 43 kDa of the GDH p subunit of 
the KS1 strain. In the amino acid sequence of the p subunit, linking motifs (SEQ ID NO: 18) that links with hem in 
cytochrome C were confirmed at 3 positions. The ORFs were located immediately downstream of the ORF of the 
structural gene of the a subunit, and a sequence that is presumed to be an SD sequence existed upstream of the 
initiation codon. 

40 [0071] Homology search for the obtained amino acid sequence by BLAST showed overall high homologies; a 65% 
homology with cytochrome C subunit of oxidoreductase dehydrogenase derived from Ralstonia solanacearum, a 48% 
homology with a cytochrome C subunit of sorbitol dehydrogenase derived from Gluconobacter oxydans, a 44% ho- 
mology with a cytochrome C subunit of gluconic acid dehydrogenase derived from Eriwinia cypripedii, and a 46.4% 
homology on an amino acid level with a cytochrome C subunit of 2-keto-gluconic acid dehydrogenase derived from 

45 Pantoea citrea. Furthermore, the amino acid sequences of cytochromes C reserved a hem-linking motif (SEQ ID NO: 
18). 

[0072] The structural gene of the GDH p subunit of the KS1 strain has a homology of 92.0% on the nucleotide 
sequnece level and of 92.2% on an amino acid level with the structural gene of the GDH p subunit of the J2315 strain. 

so Industrial Applicability 

[0073] The present invention provides the GDH p subunit of a microorganism belonging to the genus Burkholderia 
and the DNA encoding it. 

55 
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SEQUENCE LISTING 

<110> SODE, Koji 

<120> Glucose dehydrogenase beta-subuni t and DNA encoding the sane 

<130> G780-OP1551 

<141> 2003-04-25 

<150> JP 2002-125353 
<151> 2002-04-26 

<160> 19 

<170> Patenlln Ver. 2.0 

<210> 1 

<211> 2467 

<212> DNA 

<213> Burkhorderia cepacia 

<220> 

<221> CDS 

<222> (258).. (761) 

<220> 
<221> CDS 

<222> (764).. (2380) 

<220> 
<221> CDS 

<222> (2386).. (2466) 
<400> 1 

aagcttlclg tttgattgca cgcgattcta accgagcgtc tgtgaggcgg aacgcgacat 60 
gcltcgtglc gcacacgtgt cgcgccgacg acacaaaaat gcagcgaaat ggctgatcgt 120 
lacgaatggc tgacacattg aatggaclat aaaaccattg Iccgttccgg aatgtgcgcg 180 
lacattlcag gtccgcgccg atltttgaga aatatcaagc gtggtlttcc cgaatccggt 240 
gttcgagaga aggaaac alg cac aac gac aac act ccc cac teg cgt cgc 290 
Met His Asn Asp Asn Thr Pro His Ser Arg Arg 
1 . 5 10 
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cac ggc gac gca gcc gca tea ggc ale acg egg cgt Gaa tgg tig caa 338 

His Gly Asp Ala Ala Ala Ser Gly He Thr Arg Arg Gin Trp Leu Gin 

15 20 25 

ggc gcg ctg gcg ctg acc gca gcg ggc etc acg ggt teg ctg aca ttg 386 

Gly Ala Leu Ala Leu Thr Ala Ala Gly Leu Thr Gly Ser Leu Thr Leu 

30 35 40 

egg gcg ctt gca gac aac ccc ggc act gcg ccg etc gat acg ttc atg 434 
Arg Ala Leu Ala Asp Asn Pro Gly Thr Ala Pro Leu Asp Thr Phe Met 

45 50 55 

acg ctt tec gaa teg ctg acc ggc aag aaa ggg etc age cgc gig aic 482 

« Thr Leu Ser Glu Ser Leu Thr Gly Lys Lys Gly Leu Ser Arg Val He 

60 65 70 75 

ggc gag cgc ctg ctg cag gcg ctg cag aag ggc teg ttc aag acg gcc 530 

Gly Glu Arg Leu Leu Gin Ala Leu Gin Lys Gly Ser Phe Lys Thr Ala 
20 80 85 90 

gac age ctg ccg cag etc gcc ggc gcg etc gcg tec ggt teg ctg acg 578 

Asp Ser Leu Pro Gin Leu Ala Gly Ala Leu Ala Ser Gly Ser Leu Thr 

95 100 105 

25 cct gaa cag gaa teg etc gca ctg acg ate etc gag gcc tgg tat etc 626 

Pro Glu Gin Glu Ser Leu Ala Leu Thr lie Leu Glu Ala Trp Tyr Leu 

110 115 120 

ggc ate gtc gac aac gtc gtg att acg tac gag gaa gca tta atg ttc 674 

30 Gly lie Val Asp Asn Val Val lie Thr Tyr Glu Glu Ala Leu Met Phe 

125 130 135 

ggc gtc gig tec gat acg etc gtg ate cgt teg tat tgc ccc aac aaa 722 

Gly Val Val Ser Asp Thr Leu Val lie Arg Ser Tyr Cys Pro Asn Lys 
35 140 145 150 155 

ccc ggc ttc tgg gcc gac aaa ccg ate gag agg caa gcc tg atg gcc 769 
Pro Gly Phe Trp Ala Asp Lys Pro He Glu Arg Gin Ala Met Ala 
160 165 170 

40 gat acc gat acg caa aag gcc gac gtc gtc gtc gtt gga teg ggt gtc 817 

Asp Thr Asp Thr Gin Lys Ala Asp Val Val Val Val Gly Ser Gly Val 

175 180 185 

gcg ggc gcg ate gtc gcg cat cag etc gcg atg gcg ggc aag gcg gtg 865 

45 Ala Gly Ala He Val Ala His Gin Leu Ala Met Ala Gly Lys Ala Val 

190 195 200 

ate ctg etc gaa gcg ggc ccg cgc atg ccg cgc tgg gaa ate gtc gag 913 

lie Leu Leu Glu Ala Gly Pro Arg Met Pro Arg Trp Glu lie Val Glu 
50 205 210 215 

cgc itc cgc aat cag ccc gac aag atg gac tic atg gcg ccg tac ccg 961 

Arg Phe Arg Asn Gin Pro Asp Lys Met Asp Phe Met Ala Pro Tyr Pro 
220 225 230 

55 teg age ccc tgg gcg ccg cat ccc gag tac ggc ccg ccg aac gac lac 1009 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



Ser Ser Pro 
235 

ctg ate ctg 

Leu lie Leu 

gtg ggc ggc 
Yal Gly Gly 

ccg aac gac 
Pro Asn Asp 
285 

ccg ate cag 
Pro lie Gin 

300 
gag etc ggc 
Glu Leu Gly 
315 

cgc aag cag 
Arg Lys Gin 

cag acc ate 
Gin Thr He 

gtg acc gag 
Yal Thr Glu 
365 

act tgt tgc 
Thr Cys Cys 

380 
atg tac aac 
Met Tyr Asn 
395 

aag ctg ate 
Lys Leu He 



aag cgc ate 
Lys Arg He 

cgc gtc gaa 
Arg Yal Glu 
445 

ccg aag ale 
Pro Lys He 



Trp Ala 

aag ggc 
Lys Gly 
255 
acg acg 
Thr Thr 
270 

He aag 
Phe Lys 

tac gac 
Tyr Asp 

gtg tgg 
Yal Trp 

ccg tat 
Pro Tyr 
335 
aag acg 
Lys Thr 
350 

ccg gtc 
Pro Val 

ggc aac 
Gly Asn 

ggc ate 
Gly He 

gag aac 
Glu Asn 
415 
gtc gcg 
Val Ala 
430 

ggc aag 
Gly Lys 



Pro His 
240 

gag cac 
Glu His 

tgg cac 
Trp His 

atg aag 
Met Lys 

gat etc 
Asp Leu 
305 
ggc ccg 
Gly Pro 
320 

ccg atg 
Pro Met 

gcg ctg 
Ala Leu 

gcg cgc 
Ala Arg 

aac aac 
Asn Asn 
385 
gtg cac 
Val His 
400 

gcg gtc 
Ala Val 

gcg etc 
Ala Leu 

tat tic 
Tyr Phe 



ctg ctg atg tec 
Leu Leu Mel Ser 



Pro Glu 

aag ttc 
Lys Phe 

tgg gec 
Trp Ala 
275 
age gtg 
Ser Val 
290 

gag ccg 
Glu Pro 

ggc ccc 
Gly Pro 

ccg ccg 
Pro Pro 

aac aac 
Asn Asn 
355 
aac age 
Asn Ser 
370 

tgc atg 
Cys Met 

gtc gag 
Val Glu 

gtc tac 
Val Tyr 

tac aag 
Tyr Lys 
435 
gtg etc 
Val Leu 
450 

gcg aac 
Ala Asn 



Tyr Gly Pro 
245 

aac leg cag 
Asn Ser Gin 
260 

gcg leg gcg 
Ala Ser Ala 

tac ggc gtc 
Tyr Gly Val 

tac tat cag 
Tyr Tyr Gin 
310 

gag gaa gat 
Glu Glu Asp 
325 

ctg ccg ttg 
Leu Pro Leu 
340 

tac gat ccg 
Tyr Asp Pro 

cgc ccg tac 
Arg Pro Tyr 

ccg ate tgc 
Pro He Cys 
390 

aag gee gaa 
Lys Ala Glu 
405 

aag etc gag 
Lys Leu Glu 
420 

gac aag acg 
Asp Lys Thr 

gec gcg aac 
Ala Ala Asn 

cgc gat I tc 
Arg Asp Phe 



Pro Asn 

tac ate 
Tyr lie 

tgg cgc 
Trp Arg 
280 
ggc cgc 
Gly Arg 
295 

cgc gcg 
Arg Ala 

ctg tac 
Leu Tyr 

teg ttc 
Ser Phe 

aag ttc 
Lys Phe 
360 
gac ggc 
Asp Gly 
375 

ccg ate 
Pro He 



Asp Tyr 
250 
cgc gcg 
Arg Ala 
265 

ttc att 
Phe He 

gac tgg 
Asp Trp 

gag gaa 
Glu Glu 

teg ccg 
Ser Pro 
330 
aac gag 
Asn Glu 
345 

cat gtc 
His Val 

cgc ccg 
Arg Pro 

ggc gcg 
Gly Ala 



cgc gee ggc gcg 
Arg Ala Gly Ala 
410 



acg ggc 
Thr Gly 

ggc gec 
Gly Ala 
440 
ggc ate 
Gly He 
455 

ccg aac 
Pro Asn 



ccg gac 
Pro Asp 
425 

gag cat 
Glu His 

gag acg 
Glu Thr 

ggt gtc 
Gly Yal 



1057 



1105 



1153 



1201 



1249 



1297 



1345 



1393 



1441 



1489 



1537 



1585 



1633 



1681 
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460 

gcg aac age teg 

Ala Asn Ser Ser 
475 

acc ggc gtg teg 

Thr Gly Val Ser 
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40 
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55 



ccg cag 
Pro Gin 

gcg acc 
Ala Thr 

gac cag 
Asp Gin 
540 
gac gag 
Asp Glu 
555 

ttc gac 
Phe Asp 

ccg age 
Pro Ser 

acg tat 
Thr Tyr 

gag gtc 
Glu Val 
620 
ttc aac 
Phe Asn 
635 

atg ggc 
Mel Gly 



atg 
Glu Met 
510 
gaa gcg 
Glu Ala 
525 

gag acg 
Glu Thr 

etc gac 
Leu Asp 

tgc ttc 
Cys Phe 

aag acg 
Lys Thr 
590 
gcg ate 
Ala lie 
605 

tac gcg 
Tyr Ala 

gac gaa 
Asp Glu 

gec gat 
Ala Asp 



tic gac cal ccg 
Phe Asp His Pro 
670 

gtc ggt acc gta 
Val Gly Thr Val 
685 



gac atg 
Asp Met 
480 
ttc tat 
Phe Tyr 
495 

acg teg 
Thr Ser 

gcg aag 
Ala Lys 

cag aag 
Gin Lys 

gcg cag 
Ala Gin 
560 
cac gaa 
His Glu 
575 

gcg acc 
Ala Thr 

gac gac 
Asp Asp 

acc gec 
Thr Ala 

ttc gcg 
Phe Ala 
640 
gcg cgc 
Ala Arg 
655 

aac ctg 
Asn Leu 

aac gtg 
Asn Val 



465 

gtc ggc cgc aac 
Val Gly Arg Asn 



gcg age 
Ala Ser 

ctg ate 
Leu lie 

aag ate 

Lys He 
530 

ate ttc 

He Phe 
545 

ate cgc 

He Arg 

ate ctg 
He Leu 

gat gcg 
Asp Ala 

tac gtg 
Tyr Val 
610 
gcg aag 
Ala Lys 
625 

ccg aac 
Pro Asn 



gag aag 
Glu Lys 
500 
ggt ttc 
Gly Phe 
515 

cac ctg 
His Leu 

aag gec 
Lys Ala 

gac cgt 
Asp Arg 

ccg caa 
Pro Gin 
580 
ate ggc 
He Gly 
595 

aag cgc 
Lys Arg 

gtg etc 
Val Leu 

aat cac 
Asn His 



gac tec gtc gtc 
Asp Ser Val Val 
660 

ttc alt teg age 
Phe He Ser Ser 
675 

acg ctg acg ate 
Thr Leu Thr He 
690 



470 
ctg atg 
Leu Met 
485 

ctg tgg 
Leu Trp 

cgc gac 
Arg Asp 

teg aac 
Ser Asn 

ggc aag 
Gly Lys 
550 
tec gca 
Ser Ala 
565 

ccc gag 
Pro Glu 

att ccg 
lie Pro 

ggc gec 
Gly Ala 

ggc ggc 
Gly Gly 
630 
ate acg 
lie Thr 
645 

gac aag 
Asp Lys 

age gcg 
Ser Ala 

gec gcg 
Ala Ala 



gac cat 
Asp His 

ccg ggc 
Pro Gly 

ggt ccg 
Gly Pro 
520 
ctg teg 
Leu Ser 
535 

ctg atg 
Leu Met 

cgc tac 
Arg Tyr 

aac cgc 
Asn Arg 

cgc ccc 
Arg Pro 
600 
gcg cat 
Ala His 
615 

acg gac 
Thr Asp 

ggc teg 
Gly Ser 

gac tgc 
Asp Cys 

acg atg 
Thr Met 
680 
etc gcg 
Leu Ala 
695 



ccg ggc 
Pro Gly 
490 
cgc ggc 
Arg Gly 
505 

ttc cgc 
Phe Arg 

cgc ale 
Arg lie 

aag ccc 
Lys Pro 

gtg cag 
Val Gin 
570 
ate gtg 
lie Val 
585 

gag ate 
Glu He 

acg cgc 
Thr Arg 

gtc gtg 
Val Val 

acg ate 

Thr lie 
650 

cgc acg 

Arg Thr 
665 

ccg acc 

Pro Thr 

ctg egg 
Leu Arg 



1729 



1777 



1825 



1873 



1921 



1969 



2017 



2065 



2113 



2161 



2209 
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atg leg gac acg ctg aag 
Met Ser Asp Thr Leu Lys 
700 

act t tc etc ate gec ggc 
Thr Phe Leu lie Ala Gly 

715 . 
gec gat gcg gec gat c 
Ala Asp Ala Ala Asp 
730 

<210> 2 
<211> 168 
<212> PRT 
<2l3> Burkhorderia cepacia 



<400> 2 



Met 


His 


Asn Asp 


Asn 


Thr 


Pro His Ser Arg Arg His Gly Asp Ala Ala 


1 






5 




10 15 


Ala 


Ser 


Giy lie 


Thr 


Arg 


Arg Gin Trp Leu Gin Gly Ala Leu Ala Leu 






20 






25 30 


Thr 


Ala 


Ala Gly 


Leu 


Thr 


Gly Ser Leu Thr Leu Arg Ala Leu Ala Asp 






35 






40 45 


Asn 


Pro 


Gly Thr 


Ala 


Pro 


Leu Asp Thr Phe Met Thr Leu Ser Glu Ser 




50 








55 60 


Leu 


Thr 


Gly Lys 


Lys 


Gly 


Leu Ser Arg Val He Gly Glu Arg Leu Leu 


65 








70 


75 80 


Gin 


Ala 


Leu Gin 


Lys 


Gly 


Ser Phe Lys Thr Ala Asp Ser Leu Pro Gin 








85 




90 95 


Leu 


Ala 


Gly Ala 


Leu 


Ala 


Ser Gly Ser Leu Thr Pro Glu Gin Glu Ser 






100 






105 110 


Leu 


Ala 


Leu Tbr 


He 


Leu 


Glu Ala Trp Tyr Leu Gly He Val Asp Asn 






115 






120 125 


Val 


Val 


lie Thr 


Tyr 


Glu 


Glu Ala Leu Met Phe Gly Val Val Ser Asp 




130 








135 140 


Thr 


Leu 


Val lie 


Arg 


Ser 


Tyr Cys Pro Asn Lys Pro Gly Phe Trp Ala 


145 








150 


155 160 


Asp 


Lys 


Pro lie 


Glu 


Arg 


Gin Ala 



165 



<210> 3 
<2H> 539 
<212> PRT 

<213> Burkhorderia cepacia 



aag gaa gtc tgacc gtg egg aaa tct act etc 2403 
Lys Glu Val Val Arg Lys Ser Thr Leu 

705 710 

tgc etc gcg ttg ccg ggc ttc gcg cgc gcg 2451 

Cys Leu Ala Leu Pro Gly Phe Ala Arg Ala 
720 725 

2457 
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<400> 3 

Met Ala Asp Thr Asp Thr Gin Lys Ala Asp Val Val Val Val Gly Ser 

15 10 15 

Gly Val Ala Gly Ala 11c Val Ala His Gin Leu Ala Met Ala Gly Lys 

20 25 30 

Ala Val lie Leu Leu Glu Ala Gly Pro Arg Met Pro Arg Trp Glu He 

35 40 45 

Val Glu Arg Phe Arg Asn Gin Pro Asp Lys Met Asp Phe Met Ala Pro 

50 55 60 

Tyr Pro Ser Ser Pro Trp Ala Pro His Pro Glu Tyr Gly Pro Pro Asn 
65 70 75 80 

Asp Tyr Leu lie Leu Lys Gly Glu His Lys Phe Asn Ser Gin Tyr lie 

85 90 95 

Arg Ala Val Gly Gly Thr Thr Trp His Trp Ala Ala Ser Ala Trp Arg 

100 105 110 

Phe He Pro Asn Asp Phe Lys Met Lys Ser Yal Tyr Gly Val Gly Arg 

115 120 125 

Asp Trp Pro He Gin Tyr Asp Asp Leu Glu Pro Tyr Tyr Gin Arg Ala 

130 135 140 

Glu Glu Glu Leu Gly Val Trp Gly Pro Gly Pro Glu Glu Asp Leu Tyr 
145 150 155 160 

Ser Pro Arg Lys Gin Pro Tyr Pro Met Pro Pro Leu Pro Leu Ser Phe 

165 170 175 

Asn Glu Gin Thr lie Lys Thr Ala Leu Asn Asn Tyr Asp Pro Lys Phe 

180 185 190 

His Val Val Thr Glu Pro Val Ala Arg Asn Ser Arg Pro Tyr Asp Gly 

195 200 205 

Arg Pro Thr Cys Cys Gly Asn Asn Asn Cys Met Pro He Cys Pro He 

210 215 220 

Gly Ala Met Tyr Asn Gly lie Val His Val Glu Lys Ala Glu Arg Ala 
225 230 235 240 

Gly Ala Lys Leu lie Glu Asn Ala Val Val Tyr Lys Leu Glu Thr Gly 

245 250 255 

Pro Asp Lys Arg He Val Ala Ala Leu Tyr Lys Asp Lys Thr Gly Ala 

260 265 270 

Glu His Arg Val Glu Gly Lys Tyr Phe Val Leu Ala Ala Asn Gly He 

275 280 285 

Glu Thr Pro Lys He Leu Leu Met Ser Ala Asn Arg Asp Phe Pro Asn 

290 295 300 

Gly Val Ala Asn Ser Ser Asp Met Yal Gly Arg Asn Leu Met Asp His 
305 310 315 320 

Pro Gly Tin Gly Val Ser Phe Tyr Ala Ser Glu Lys Leu Trp Pro Gly 
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oil) 


OOU 


000 




Arg 


Gly 


Pro 


r \ n flu II a * Thr 

Gin Glu Met inr 


Cot* Ion 1 1 a r*l\r T^rio Arrr /Ion 

ozi lcu lie uiy rne Arg asp 


biy 


r ro 








340 


040 OOU 






Phe 


Arg 


Ala 


Thr GIu Ala Ala 


Lys Lys iie ms Leu oer Asn 


Leu 


C a t- 

oer 






355 










Arg 


lie 


Asp 


Gin GIu Thr Gin 


Lys Me rne Lys Ala Giy Lys 


Leu 


Me t 




370 




o75 


380 






Lys 


Pro 


Asp 


GIu Leu. Asp Ala 


Gin lie Arg Asp Arg Ser Ala 


A _ _ 

Arg 


Tyr 


385 






390 


o a r 

395 




400 


Val 


Gin 


Phe 


Asp Cys Phe His 


Glu lie Leu Pro Gin Pro GIu 


Asn 


Arg 








405 


410 


415 




lie 


Yal 


Pro 


Ser Lys Thr Ala 


Thr Asp Ala He Gly He Pro 


Arg 


Pro 








420 


425 430 






GlU 


lie 


Thr 


Tyr Ala lie Asp 


Asp Tyr Val Lys Arg Gly Ala 


Ala 


His 






435 




440 445 






Thr 


Arg 


GIu 


Val Tyr Ala Thr 


Ala Ala Lys Val Leu Gly Gly 


Thr 


Asp 




450 




455 


460 






Val 


Val 


Phe 


Asn Asp GIu Phe 


Ala Pro Asn Asn His He Thr 


Gly 


Ser 


465 






470 


475 




480 


Thr 


lie 


Met 


Gly Ala Asp Ala 


Arg Asp Ser Val Val Asp Lys 


Asp 


Cys 








485 


490 


495 




Arg 


Thr 


Phe 


Asp His Pro Asn 


Leu Phe He Ser Ser Ser Ala 


Thr 


Met 








500 


505 510 






Pro 


Thr 


Val 


Gly Thr Val Asn 


Val Thr Leu Thr lie Ala Ala 


Lcu 


Ala 






515 




520 525 






Leu 


Arg 


Met 


Ser Asp Thr Leu 


Lys Lys Glu Val 








530 




535 









<210> 4 
<211> 27 
<212> PRT 

<213> Burkhorderia cepacia 
<400> 4 

Val Arg Lys Ser Thr Leu Thr 

1 5 
Pro Gly Phe Ala Arg Ala Ala 
20 

<210> 5 
<211> 28 
<212> DNA 

<213> Artificial Sequence 



Phe Leu lie Ala Gly Cys Leu Ala Leu 

10 15 
Asp Ala Ala Asp 
25 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 5 

cccaagcMg ggccgatacc gatacgca 

<2I0> 6 
<211> 29 
<212> DNA 

<2l3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

gagaagcttt ccgcacggtc agacttcc 

<210> 7 

<21l> 27 

<2J2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 7 

catgccalgg cacacaacga caacact 

<210> 8 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 8 

cccaagctlg ggtcagactt ccttcttcag c 

<2l0> 9 
<2ll> 16 
<212> PRT 
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<2l3> Burkhorderia cepacia 
<400> 9 

Ala Asp Ala Ala Asp Pro Ala Leu Val Lys Arg Gly Glu Tyr Leu Ala 
I 5 10 15 

<2I0> 10 

<2U> 25 

<212> PRT 

<213> Arti f icial Sequence 
<220> 

<223> Description of Artificial Sequence :consensus 
<220> 

<221> UNSURE 

<222> (6,17,18,19,22) 

<223> Xaa=unknown 

<400> 10 

Ala Asp Ala Ala Asp Xaa Ala Leu Val Lys Arg Gly Glu Tyr Leu Ala 

15 10 15 

Xaa Xaa Xaa Asp Cys Xaa Ala Cys His 
20 25 

<210> 11 
<211> 2410 
<212> DNA 

<213> Burkholderia cepacia 

<220> 
<221> CDS 

<222> (673).. (1950) 
<400> 11 

galggaccac ccgggcaccg gcgtgtcgtt ctacgcgaac gagaagclgt ggccgggccg 60 

cggcccgcag gatatgacgt cgctgatcgg tttccgcgac ggcccgttcc gcgcgaccga 120 

agccgcgaag aagatccatc tgtcgaacat gtcccgcalc aaccaggaga cgcagaagat 180 

cttcaaggcc ggcaaactga tgaagcacga ggagclcgac gcgcagatcc gcgaccgttc 240 

cgcgcgctac glgcagttcg actgcttcca cgagattctg ccgcagcccg agaaccgcat 300 

cgtgccgagc aagacggcca ccgacgcgat cgggatcccg cgccccgaga tcacgtatgc 360 

gatcgacgat tacgtgaagc gcggcgccgt gcacacgcgc gaggtctacg cgacggccgc 420 

gaaggtgcig ggcggcaccg acgtcgtctt caacgacgag Itcgcgccga acaaccacat 480 
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cacgggcgcg aggatcatgg gcgcggalgc acgcgactcg glcglcgaca aggactgccg 540 
cacgtlcgac catccgaacc tgttcctctc gagcagctcg acgatgccga ccgtcggtac 600 
ggtgaacgtg acgctgacga tcgcggcgct cgcgctgcgg atgtcggaca cgctgaagaa 660 
ggaagtctga cc gtg egg aaa tct act etc acc Itc etc etc gec ggc tgc 7Jl 
Val Arg Lys Ser Thr Leu Thr Phe Leu Leu Ala Gly Cys 
I 5 10 

etc gcg ctg ccc ggc etc gca cgc gcg gee gat teg gee gat ccg gcg 759 
Leu Ala Leu Pro Gly Leu Ala Arg Ala Ala Asp Ser Ala Asp Pro Ala 

15 20 25 

cat gtc aag cgc ggc gaa tac etc gee gtc gcg ggc gac tgc atg gca 807 
His Val Lys Arg Gly Glu Tyr Leu Ala Val Ala Gly Asp Cys Met Ala 
30 35 40 45 

tgc cac acc gcg aag ggc ggc aag ccg ttc gcg ggc ggc etc ggc atg 855 
Cys His Thr Ala Lys Gly Gly Lys Pro Phe Ala Gly Gly Leu Gly Met 

50 55 60 

ccg gig ccg atg etc ggc aag ate tat acg age aac ate aca ccg gat 903 
Pro Val Pro Met Leu Gly Lys He Tyr Thr Ser Asn He Thr Pro Asp 

65 70 75 

ccc gat acc ggc ate ggc aac tgg acg ttc gag gac tic gag cgc gcg 951 
Pro Asp Thr Gly He Gly Asn Trp Thr Phe Glu Asp Phe Glu Arg Ala 

80 85 90 

gtg egg cac ggc gta teg aag aac ggc gac aac ctg tac ccg gcg atg 999 
Val Arg His Gly Val Ser Lys Asn Gly Asp Asn Leu Tyr Pro Ala Met 

95 100 105 

ccg tac gtg teg tac gcg aag ate aac gac gac gac gtg caa gcg ctg 1047 
Pro Tyr Val Ser Tyr Ala Lys He Asn Asp Asp Asp Val Gin Ala Leu 
110 115 120 125 

tac gcg tac ttc atg cac ggc gle gaa ccg gtc aag cag gcg ccg ccg 1095 
Tyr Ala Tyr Phe Met His Gly Val Glu Pro Val Lys Gin Ala Pro Pro 

130 135 140 

aag aac gag ate ccc gcg ctg ctg age atg cgc tgg ccg ctg aag ate 1143 
Lys Asn Glu He Pro Ala Leu Leu Ser Met Arg Trp Pro. Leu Lys lie 

145 150 155 

tgg aac tgg ctg ttc ctg aag gac ggc gtg tac cag ccg aag ccc gag 1191 
Trp Asn Trp Leu Phe Leu Lys Asp Gly Val Tyr Gin Pro Lys Pro Glu 

160 165 170 

cag age gec gag tgg aac cgc ggc gec tat etc gtg cag ggc etc gcg 1239 
Gin Ser Ala Glu Trp Asn Arg Gly Ala Tyr Leu Val Gin Gly Leu Ala 

175 180 185 

cac tgc age acg tgc cac acg ccg cgc ggc ate gcg atg cag gag aag 1287 
His Cys Ser Thr Cys His Thr Pro Arg Gly He Ala Met Gin Glu Lys 
190 195 200 205 

teg etc gac gaa acg ggc ggc age ttc ctg teg ggc teg gtg etc gcg 1335 
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Ser Leu Asp Glu Thr Gly Gly Ser Phe Leu Ser Gly Ser Val Leu Ala 

210 215 220 

ggc tgg gac ggc tac aac ate acg tec gac ccg aac gcg ggg ate ggc 1383 

Gly Trp Asp Gly Tyr Asn lie Thr Ser Asp Pro Asn Ala Gly He Gly 

225 230 235 

ggc tgg acg cag cag cag etc gtc cag lac ctg cgc acc ggc age gtg 1431 

Gly Trp Thr Gin Glh Gin Leu Val Gin Tyr Leu Arg Thr Gly Ser Val 

240 245 250 

ccg ggc etc gcg cag gcg gec ggc ccg atg gec gag gcg ate gag cac 1479 

Pro Gly Leu Ala Gin Ala Ala Gly Pro Met Ala Glu Ala He Glu His 

255 260 265 

age ttc teg aag atg acc gaa gee gac ate ggc ggc ccg atg gec gag 1527 

Ser Phe Ser Lys Mel Thr Glu Ala Asp He Gly Gly Pro Met Ala Glu 

270 275 280 285 

gcg ate gag cac age ttc teg aag atg acc gaa gec gac ate ggc cgc 1575 

Ala He Glu His Ser Phe Ser Lys Met Thr Glu Ala Asp He Gly Arg 

290 295 300 

leg teg tgg ggc aag ccg gec gag gat ggc ctg aag ctg cgc ggc gtc 1623 

Ser Ser Trp Gly Lys Pro Ala Glu Asp Gly Leu Lys Leu Arg Gly Val 

305 310 315 

gcg etc gcg teg teg ggc ate gal ccg gca ccg ctg tat etc ggc aac 1671 

Ala Leu Ala Ser Ser Gly He Asp Pro Ala Pro Leu Tyr Leu Gly Asn 

320 325 330 

tgc gcg acc tgc cac cag atg cag ggc aag ggc acg ccg gac ggt tac 1719 

Cys Ala Thr Cys His Gin Met Gin Gly Lys Gly Thr Pro Asp Gly Tyr 

335 340 345 

tac ccg ccg ttg ttc cac aac teg acg gtc ggc gcg teg aat ccg acc 1767 

Tyr Pro Pro Leu Phe His Asn Ser Thr Val Gly Ala Ser Asn Pro Thr 

350 355 360 365 

aac etc gtg cag gtg ate ctg aac ggc gtg cag cgc aag gec ggc age 1815 

Asn Leu Val Gin Val lie Leu Asn Gly Val Gin Arg Lys Ala Gly Ser 

370 375 380 

gag gac gtc ggg atg ccc gcg ttc cgc cac gag ctg teg gat gcg cag 1863 

Glu Asp Val Gly Mel Pro Ala. Phe Arg His Glu Leu Ser Asp Ala Gin 

385 390 395 

ate gec gcg ctg acg aac tac ctg acg ggg cag ttc ggc aat ccg gec 1911 

He Ala Ala Leu Thr Asn Tyr Leu Thr Gly Gin Phe Gly Asn Pro Ala 

400 405 410 

gcg aag gtg acc gag cag gac gtc gcg aag ctg cgc tga aacgcggcac 1960 

Ala Lys Val Thr Glu Gin Asp Val Ala Lys Leu Arg 
415 420 425 
geggegagge agggcaacaa tagaaaagag gaggagcaca geacateggg cgggccccga 2020 
tgccggttgt tgeagagegg gaegggegge geaggeggtc gcccgtcctg gttcacaggc 2080 
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aatccgglgc gcgcacgccg cgcatcgttt 
ccgtllc.kg cgtcccagcg cgatgtgctg 
ctgttcgtga tgtlcggclg gaagaagatt 
ggcagcgagg gcgcgccggc gccgatcatc 
atcgtcggga ttgcgatcct cgtcggttlc 
ctgtacacga tcggtaccgg catcatcggc 



tcgttgatcg agaccatgac accgaaccaa 2140 
ctgctgctgl cccgaatcct gclcgtgatc 2200 
atcgactlcl ccggtacgat cgcgttcalg 2260 
tcggcggcga tctccglcgt gatggagctc 2320 
cagacgcggc cgctcgcgct gttgcttgcg 2380 

2410 



<2l0> 12 
<211> 425 
<212> PRT 

<213> Burkholderia cepacia 



<400> 12 
Val Arg Lys 
1 

Pro Gly Leu 

Arg Gly Glu 
35 

Ala Lys Gly 
50 

Met Leu Gly 
65 

Gly lie Gly 

Gly Val Ser 

Ser Tyr Ala 
115 

Phe Met His 
130 

lie Pro Ala 
145 

Leu Phe Leu 

Glu Trp Asn 

Thr Cys His 
195 

Glu Thr Gly 
210 

Gly Tyr Asn 
225 



Ser Thr 

5 

Ala Arg 

20 
Tyr Leu 

Gly Lys 

Lys lie 

Asn Trp 
85 

Lys Asn 
100 

Lys lie 

Gly Val 

Leu Leu 

Lys Asp 
165 
Arg Gly 
180 

Thr Pro 
Gly Ser 
lie Thr 



Leu Thr Phe Leu 
Ala Ala 
Ala Val 



Pro Phe 
55 

Tyr Thr 
70 
Thr Phe 

Gly Asp 

Asn Asp 

Glu Pro 
135 
Ser Met 
150 

Gly Val 

Ala Tyr 

Arg Gly 

Phe Leu 
215 
Ser Asp 
230 



Asp Ser 
25 

Ala Gly 
40 

Ala Gly 



Ser Asn 

Glu Asp 

Asn Leu 
105 
Asp Asp 
120 

Val Lys 

Arg Trp 

Tyr Gin 

Leu Val 
185 
He Ala 
200 

Ser Gly 
Pro Asn 



Leu Ala Gly 
10 

Ala Asp Pro 

Asp Cys Met 

Gly Leu Gly 
60 

lie Thr Pro 

75 

Phe Glu Arg 
90 

Tyr Pro Ala 

Val Gin Ala 

Gin Ala Pro 
140 

Pro Leu Lys 
155 

Pro Lys Pro 
170 

Gin Gly Leu 

Met Gin Glu 

Ser Val Leu 
220 

Ala Gly He 
235 



Cys Leu Ala Leu 
15 

Ala Hi s Val Lys 
30 

Ala Cys His Thr 
45 

Met Pro Val Pro 



Asp Pro 

Ala Val 

Met Pro 
110 
Leu Tyr 
125 

Pro Lys 

He Trp 

Glu Gin 

Ala His 
190 
Lys Ser 
205 

Ala Gly 
Gly Gly 



Asp Thr 
80 

Arg His 

95 
Tyr Val 

Ala Tyr 

Asn Glu 

Asn Trp 
160 
Ser Ala 
175 

Cys Ser 

Leu Asp 

Trp Asp 

Trp Thr 
240 
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Gin Gin 

Ala Gin 

Lys Met 

His Ser 
290 
Giy Lys 
305 

Ser Ser 

Cys His 

Leu Phe 

Gin Val 
370 
Gly Mel 
385 

Leu Thr 
Thr Glu 



Gin Leu VaJ 
245 

Ala Ala Gly 
260 

Thr Glu Ala 
275 

Phe Ser Lys 

Pro Ala Glu 

Gly lie Asp 
325 

Gin Met Gin 
340 

His Asn Ser 
355 

lie Leu Asn 

Pro Ala Phe 

Asn Tyr Leu 
405 

Gin Asp Val 
420 



Gin Tyr Leu Arg 
Pro Mel 
Asp lie 



Met Thr 
295 
Asp Gly 
310 

Pro Ala 

Gly Lys 

Thr Val 

Gly Val 
375 
Arg His 
390 

Thr Gly 
Ala Lys 



Ala Glu 
265 
Gly Gly 
280 

Glu Ala 



Leu Lys 

Pro Leu 

Gly Thr 
345 
Gly Ala 
360 

Gin Arg 

Glu Leu 

Gin Phe 

Leu Arg 
425 



Thr Gly 
250 

Ala He 

Pro Met 

Asp He 

Leu Arg 
315 
Tyr Leu 
330 

Pro Asp 

Ser Asn 

Lys Ala 

Ser Asp 
395 
Gly Asn 
410 



Ser Val Pro 

Glu His Ser 
270 

Ala Clu Ala 

285 
Gly Arg Ser 
300 

Gly Val Ala 

Gly Asn Cys 

Gly Tyr Tyr 
350 

Pro Thr Asn 

365 
Gly Ser Glu 
380 

Ala Gin He 
Pro Ala Ala 



Gly Leu 
255 

Phe Ser 

He Glu 

Ser Trp 

Leu Ala 
320 
Ala Thr 
335 

Pro Pro 

Leu Yal 

Asp Val 

Ala Ala 
400 
Lys Val 
415 



<210> 13 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

Igcaccgtgc ggaaatctac tctcact 

<210> 14 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 14 

acttccllcl tcagcglgtc cgacatc 27 

<210> 15 
<211> 1441 
<212> DNA 

<2!3> Burkholderia cepacia 

<220> 
<221> CDS 

<222> (121).. (1398) 
<400> 15 

tccgaacctg ttcatttcga gcagcgcgac gatgccgacc gtcggtaccg taaacgtgac 60 
gctgacgatc gccgcgctcg cgctgcggat gtcggacacg ctgaagaagg aagtctgacc 120 
gtg egg aaa let act etc act ttc etc ate gee ggc tgc etc gcg ttg 168 
Val Arg Lys Ser Thr Leu Thr Phe Leu He Ala Gly Cys Leu Ala Leu 

! 5 10 15 

ccg ggc ttc gcg cgc gcg gee gat gcg gee gat ccg gcg ctg gtc aag 216 
Pro Gly Phe A!a Arg Ala Ala Asp Ala Ala Asp Pro Ala Leu Val Lys 

20 25 30 

cgc ggc gaa tac etc gcg ace gee atg ccg gta ccg atg etc ggc aag 264 
Arg Gly Glu Tyr Leu Ala Thr Ala Met Pro Yal Pro Met Leu Gly Lys 

35 40 45 

ate tac acg age aac ate acg ccc gat ccc gat acg ggc gac tgc atg 312 
He Tyr Thr Ser Asn He Thr Pro Asp Pro Asp Thr Gly Asp Cys Net 

50 55 60 

gec tgc cac acc gtg aag ggc ggc aag ccg tac gcg ggc ggc ctt ggc 360 
Ala Cys His Thr Val Lys Gly Gly Lys Pro Tyr Ala Gly Gly Leu Gly 
65 70 75 80 

ggc ate ggc aaa tgg acg ttc gag gac ttc gag cgc gcg gtg egg cac 408 
Gly He Gly Lys Trp Thr Phe Glu Asp Phe Glu Arg Ala Yal Arg His 

85 90 95 

ggc gtg teg aag aac ggc gac aac ctg tat ccg gcg atg ccg lac gtg 456 
Gly Val Ser Lys Asn Gly Asp Asn Leu Tyr Pro Ala Met Pro Tyr Val 

100 105 U0 

teg tac gcg aag ate aag gac gac gac gta cgc gcg ctg tac gec tac 504 
SeT Tyr Ala Lys He Lvs Asp Asp Asp Val Arg Ala Leu Tyr Ala Tyr 

115 120 125 

ttc atg cac ggc gtc gag ccg gtc aag cag gcg ccg ccg aag aac gag 552 
Phe Met His Gly Yal Glu Pro Val Lys Gin Ala Pro Pro Lys Asn Glu 
130 135 HO 
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ale cca gcg ctg eta age atg cgc tgg ccg ctg aag ate tgg aac tgg 600 

He Pro Ala Leu Leu Ser Met Arg Trp Pro Leu Lys lie Tip Asn Trp 

5 145 1 50 155 1 60 

ctg ttc ctg aag gac ggc ccg tac cag ccg aag ccg teg cag age gee 648 

Leu Phe Leu Lys Asp Gly Pro Tyr Gin Pro Lys Pro Ser Gin Ser Ala 

165 170 175 

w gaa tgg aat cgc ggc gcg tal ctg gtg cag ggt etc gcg cac tgc age 696 

Glu Trp Asn Arg Gly Ala Tyr Leu Val Gin G!y Leu Ala His Cys Ser 

180 185 190 

acg tgc cac acg ccg cgc ggc ate gcg atg cag gag aag teg etc gac 744 

15 Thr Cys His Thr Pro Arg Gly He Ala Met Gin Glu Lys Ser Leu Asp 
195 200 205 

gaa acc ggc ggc age ttc etc gcg ggg teg gtg etc gec ggc tgg gac 792 

Glu Thr Gly Gly Ser Phe Leu Ala Gly Ser Val Leu Ala Gly Trp Asp 

20 210 215 220 

ggc tac aac ate acg teg gac ccg aat gcg ggg ate ggc age tgg acg 840 

Gly Tyr Asn lie Thr Ser Asp Pro Asn Ala Gly lie Gly Ser Trp Thr 

225 230 235 240 

25 cag cag cag etc gtg cag tat ttg cgc acc ggc age gtg ccg ggc gtc 888 

Gin Gin Gin Leu Val Gin Tyr Leu Arg Thr Gly Ser Val Pro Gly Val 

245 250 255 

gcg cag gcg gec ggg ccg atg gec gag gcg gtc gag cac age ttc teg 936 

30 Ala Gin Ala Ala Gly Pro Met Ala Glu Ala Val Glu His Ser Phe Ser 

260 265 270 

aag atg acc gaa gcg gac ate ggt gcg ate gee acg tac gtc cgc acg 984 

Lys Met Thr Glu Ala Asp He Gly Ala He Ala Thr Tyr Val Arg Thr 

35 275 280 285 

gtg ccg gec gtt gec gac age aac gcg aag cag ccg egg teg teg tgg 1032 

Val Pro Ala Val Ala Asp Ser Asn Ala Lys Gin Pro Arg Ser Ser Trp 
290 295 300 

40 ggc aag ccg gec gag gac ggg ctg aag ctg cgc ggt gtc gcg etc gcg 1080 

Gly Lys Pro Ala Glu Asp Gly Leu Lys Leu Arg Gly Val Ala Leu Ala 

305 * 310 31 5 320 

teg teg ggc ate gat ccg gcg egg ctg tat etc ggc aac tgc gcg acg 1128 

45 Ser Ser Gly He Asp Pro Ala Arg Leu Tyr Leu Gly Asn Cys Ala Thr 

325 330 335 

tgc cac cag atg cag ggc aag ggc acg ccg gac ggc tat tac ccg teg 1176 

Cys His Gin Met Gin Gly Lys Gly Thr Pro Asp Gly Tyr Tyr Pro Ser 

so 340 345 350 

ctg ttc cac aac tec acc gtc. ggc gcg teg aat ccg teg aac etc gtg 1224 

Leu Phe His Asn Ser Thr Val Gly Ala Ser Asn Pro Ser Asn Leu Val 
355 360 365 

55 cag gtg ate ctg aac ggc gtg cag cgc aag ate ggc age gag gat ate 1272 
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w 



15 



30 



35 



40 



45 



50 



Gin Val He Leu Asn Gly Val Gin Arg Lys lie Gly Ser Glu Asp He 

370 375 380 

ggg atg ccc get ttc cgc tac gal ctg aac gac gcg cag ale gec gcg 1320 
Gly Met Pro Ala Phe Arg Tyr Asp Leu Asn Asp Ala Gin lie Ala Ala 
385 390 395 400 

ctg acg aac tac gig acc gcg cag ttc ggc aat ccg gcg gcg aag gig 
Leu Thr Asn Tyr Val Thr Ala Gin Phe Gly Asn Pro Ala Ala Lys Val 

405 410 415 

acg gag cag gac gtc gcg aag cfg cgc tga calagtcggg cgcgccgaca 
Thr Glu Gin Asp Vai Ala Lys Leu Arg 
420 425 
cggcgcaacc gataggacag gag ,4 41 



<210> 16 
» <211> 425 
<212> PRT 

<213> Burkholderia cepacia 
25 <400> 16 

Val Arg Lys Ser Thr Leu Thr Phe Leu lie Ala Gly Cys Leu Ala Leu 

1 5 10 15 

Pro Gly Phe Ala Arg Ala Ala Asp Ala Ala Asp Pro Ala Leu Val Lys 

20 25 30 

Arg Gly Glu Tyr Leu Ala Thr Ala Met Pro Val Pro Met Leu Gly Lys 

35 40 45 

He Tyr Thr Ser Asn He Thr Pro Asp Pro Asp Thr Gly Asp Cys Mel 

50 55 60 

Ala Cys His Thr Val Lys Gly Gly Lys Pro Tyr Ala Gly Gly Leu Gly 
65 70 75 80 

Gly He Gly Lys Trp Thr Phe Glu Asp Phe Glu Arg Ala Val Arg His 

85 90 95 

Gly Val Ser Lys Asn Gly Asp Asn Leu Tyr Pro Ala Met Pro Tyr Val 

100 105 110 

Ser Tyr Ala Lys lie Lys Asp Asp Asp Val Arg Ala Leu Tyr Ala Tyr 

115 120 125 

Phe Met His Gly Val Glu Pro Val Lys Gin Ala Pro Pro Lys Asn Glu 

130 135 140 

He Pro Ala Leu Leu Ser Met Arg Trp Pro Leu Lys He Trp Asn Trp 
145 150 155 160 

Leu Phe Leu Lys Asp Gly Pro Tyr Gin Pro Lys Pro Ser Gin Ser Ala 

165 170 175 

Glu Trp Asn Arg Gly Ala Tyr Leu Val Gin Gly Leu Ala His Cys Ser 
55 180 185 190 



1368 



1418 
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Thr Cys His Thr Pro Arg 
195 

Gly Gly Ser Phe 



Glu Thr 
210 
Gly Tyr 
225 

Gin Gin 

Ala Gin 

Lys Met 

Val Pro 
290 
Gly Lys 
305 

Ser Ser 

Cys His 

Leu Phe 

Gin Val 
370 
Gly Met 
385 

Leu Thr 



Asn lie Thr Ser 
230 

Gin Leu Val Gin 
245 

Ala Ala Gly Pro 
260 

Thr Glu Ala Asp 

275 

Ala Val Ala Asp 

Pro Ala Glu Asp 
310 

Gly lie Asp Pro 
325 

Gin Met Gin Gly 
340 

His Asn Ser Thr 
355 

lie Leu Asn Gly 



Pro Ala Phe Arg 
390 

Asn Tyr Val Thr 
405 

Thr Glu Gin Asp Val Ala 
420 



Gly lie 
200 
Leu Ala 
215 

Asp Pro 

Tyr Leu 

Met Ala 

He Gly 
280 
Ser Asn 
295 

Gly Leu 

Ala Arg 

Lys Gly 

Val Gly 
360 

Val Gin 
375 

Tyr Asp 

Ala Gin 

Lys Leu 



Ala Met Gin 

Gly Ser Val 

Asn Ala Gly 
235 

Arg Thr Gly 

250 
Glu Ala Val 
265 

Ala lie Ala 

Ala Lys Gin 

Lys Leu Arg 
315 

Leu Tyr Leu 
330 

Thr Pro Asp 
345 

Ala Ser Asn 

Arg Lys He 

Leu Asn Asp 
395 

Phe Gly Asn 
410 

Arg 
425 



Glu Lys 
205 
Leu Ala 
220 

lie Gly 

Ser Val 

Glu His 

Thr Tyr 
285 

Pro Arg 
300 

Gly Val 

Gly Asn 

Gly Tyr 

Pro Ser 
365 
Gly Ser 
380 

Ala Gin 
Pro Ala 



Ser Leu Asp 

Gly Trp Asp 

Ser Trp Thr 
240 

Pro Gly Val 

255 
SeT Phe Ser 
270 

Val Arg Thr 

Ser Ser Trp 

Ala Leu Ala 
320 

Cys Ala Thr 

335 
Tyr Pro Ser 
350 

Asn Leu Val 

Glu Asp He 

lie Ala Ala 
400 

Ala Lys Val 
415 



<210> 17 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: heme binding motif 



<220> 

<221> UNSURE 
<222> (2,3) 
<223> Xaa=unknown 



27 



EP 1 498 484 A1 



<400> 17 

Cys Xaa Xaa Cys His 
1 5 

<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 18 

catgccalgg cacacaacga caacact 

<210> 19 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 19 

cccaagcttg ggtcagactt ccttcttcag c 



Claims 

1 . A protein defined in the following (A) or (B): 

(A )a P ro,ein which has atieas, tire a-ino acid sequence comprising « 
B a protein which has at least the amino acid sequence compns.ng am.no acrts 23 to 425 
16 induding substitution, deletion, insertion or addition of 1 to 20 am.no ac.d res.dues. 

2. A DNA encoding a protein defined in the following (A) or (B): 

Wapmteinwhfchhasat^ 

B a protein which has at least the amino acid sequence compns.ng am.no ac.ds 23 to 425 of SEQ 
16 induding substitution, deletion, insertion or addition of 1 to 20 ammo ac.d res.dues. 

3. The DNA according to claim 2, wherein the DNA is defined in the following (a) or (b): 



28 



EP 1 498 484 A1 

(a) a DNA comprising the nucleotide sequence consisting of nuclotides 187 to 1398 of SEQ ID NO: 15; 

(b) a DNA which is hybridizable with the nucleotide sequnece consisting of nucleotides 187 to 1398 of SEQ 
ID NO: 15 under stringent conditions. 

5 4. The DNA according to claim 3, further comprising the nucleotide sequnece consisting of nucleotides 121 to 187 
of SEQ ID NO: 15. 

5. A recombinant vector comprising the DNA according to any one of claims 2 to 4. 

10 6. A transf ormant transformed with the DNA according to any one of claims 2 to 4 or the recombinant vector according 
to claim 5. 

7. A method of producing a glucose dehydrogenase fi subunit, comprising culturing the transformant according to 
claim 6 to produce a glucose dehydrogenase p subunit as an expression product of the DNA, and collecting the 

15 produced p subunit. 

8. The DNA according to claim 3 or 4, further comprising the nucleotide sequnece encoding an a subunit and a y 
subunit of glucose dehydrogenase of Burkholderia cepacia. 

20 9. A recombinant vector comprising the DNA according to claim 8. 

10. A transformant transformed with the DNA according to claim 8 or the recombinant vector according to claim 9. 

11. A method of producing a glucose dehydrogenase complex, comprising culturing the transformant according to 
25 claim 10 to produce a glucose dehydrogenase complex as an expression product of the DNA, and collecting the 

produced complex. 
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